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Abstract 



A commonly used characteristic of statistical dependence of adjacency relations in real 
networks, the clustering coefficient, evaluates chances that two neighbours of a given vertex 
are adjacent. Another characteristic is obtained by considering conditional probabilities that 
two randomly chosen vertices are adjacent given that they have r common neighbours. We 
denote such probabilities cl{r) and call r — cl{r) the clustering function. We compare 
clustering functions of several networks having non- negligible clustering coefficient. They 
show similar patterns and surprising regularity. We also provide mathematically rigorous 
analysis of the clustering function of related random intersection graph models aimed at 
explaining the empirical results. 
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1 Introduction 

Our study is motivated by the question: given two vertices of a network, the presence of how 
many common neighbours would imply with certainty that these two vertices are adjacent. A 
"softer" question is about the probability that two vertices with (at least) r common neighbours 
establish a link. The answer is given by the clustering functions ([T]) and ([2]). 
Let Q = (V, 8) be a finite graph on the vertex set V and with the edge set £. The number of 
neighbours of a vertex v is denoted d{v). The number of common neighbours of vertices Vi and 
Vj is denoted d{vi, Vj). We are interested in the fraction of adjacent pairs Vi ~ Vj among all pairs 
{vi,Vj} C V having (at least) r common neighbours. Here and below '~' denotes the adjacency 
relation of Q. More formally, let us consider the random pair of distinct vertices {vl,V2} drawn 
from V uniformly at random. Define the clustering functions of G 



In the case of a social network ([T]), ([2]) could be interpreted as measures of social influence or 
pressure exercised by the neighbours on a pair of actors to establish a communication link. We 
remark that characteristics ([T]) and ^ are related to the clustering coefficient of Q. We recall its 
definition for convenience. Let {vl,V2,v^) be an ordered triple of distinct vertices drawn from V 
uniformly at random. The conditional probability that f J is adjacent to V2, given that vf and 
V2 are both adjacent to V3, is called the (global) clustering coefficient ([3], ^T6\, jlT], [21]). We 
denote it Cg = P(z;| ~ V2\vl ~ ^3,1^2 ~ vt)- 
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In this paper we study clustering functions first by considering empirical data and then by a 
rigorous analysis of related random graph models. 

We consider clustering function ([T]) of real networks admitting positive clustering coefficient: the 
actor network, where two actors are declared adjacent whenever they have acted in the same film 
([27]), and the Facebook network ([T], [T2], [251). We remark that empirical plots show similar 
pattern and surprising regularity of the clustering function r — )• clg{r) (see Sect. 2 below). 
Our choice of the random graph model is motivated by an observation of Newman et al. [18] that 
the clustering property of some social networks (so called affiliation networks) could be explained 
by the presence of a bipartite graph structure. For example, the bipartite graph, where actors are 
linked to films, defines the actor network. It seems reasonable that a bipartite graph structure 
might also be helpful in explaining (at least to some extent) the adjacency relations of Facebook 
network: two members become adjacent because they share some common interests/attributes. 
We secondly consider clustering function ([T]) of a random intersection graph, where vertices 
(actors) are prescribed attribute sets independently at random and two vertices are declared 
adjacent whenever they share at least one common attribute ([l5j, [13], see also [2], [H]). The 
random intersection graphs are relatively simple objects and for them rigorous mathematical 
results can be obtained. We evaluate the probabilities Y'{v\ ~ d{v\,V2) = r), r = 0, 1, 2, . . . 
for a random intersection graph in Sect. 3 below. Our theoretical results are then used to 
interpret empirical findings of Sect 2. 

2 Clustering functions: empirical results 

In Fig.l we plot clustering functions ([T]) and ([2]) of three drama actor networks: English actor 
network with n = 402622 actors, m = 66127 films and the clustering coefficient C = 0.32 
(clustering coefficient here and below is rounded up to 2 decimal places), French actor network 
with n = 43204 actors, m = 5629 films and the clustering coefficient C = 0.30 and Russian actor 
network with n = 9880 actors, m = 2459 films and C = 0.44. Data is obtained from ^J. In 
Fig. 2 we plot clustering function Q of three networks describing relations between community 
members at three different universities (data from [25j): the first network has n = 17425 vertices 
and the clustering coefficient C = 0.16 (• blue graph); the second network has n = 9414 vertices 
and the clustering coefficient C = 0.15 (* green graph); the third network has n = 6596 vertices 
and the clustering coefficient C = 0.16 (■ red graph). 

3 Clustering functions of random intersection graphs 

Vertices vi, . . . ,Vn of an intersection graph are represented by subsets Di, . . . , Dn of a given 
ground set W = {wi, . . . ,Wm}- Elements of W are called attributes or keys. Vertices Vi and 
Vj are declared adjacent if Did Dj 7^ 0. The adjacency relations of such an intersection graph 
resemble that of some real networks, e.g., the collaboration network, where authors are declared 
adjacent whenever they have coauthored a paper, or the actor network, where two actors are 
linked by an edge whenever they have acted in the same film. Random intersection graph have 
attracted considerable attention in the recent literature, see, e.g., [1], [5], [7] [9], [11], [20], [19], 
|26| . They admit a power law degree distribution and tunable clustering. We consider two 
models of random intesection graphs: the active graph and the inhomogeneous graph. 
Active graph. In the active random intersection graph Gi{n,m, P) every vertex Vi £ V = 
{vi, . . . ,Vn} selects its attribute set Di independently at random ([l3], [IS]). Here we assume 
for simplicity that independent random sets Di, . . . , Dn have the same probability distribution 
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Figure 1: Clustering functions for tliree actor networks: (a) d{r), (b) Cl{r). 
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Figure 2: Clustering functions of three university networks. 

such that all attributes have equal probabilities to be selected. In particular, we have 

P(A = ^) = (|5''P(|A|), for any AcW. (3) 

Here P is the common probability distribution of the sizes of selected sets Xi := \Di\ (for each 
i = l,...,n we have P(Xi = k) = P{k), k = 0,1,... m). We remark that Xi, . . . , Xn are 
independent random variables taking values in {0, 1, ... , m}. 
Here we study the clustering function 

r — )• cl{r) = P{vl ~ 112! d{vl, V2) = r) = P(wi ~ V2\ ^(^1,^2) = r) (4) 

of a sparse random intersection graph with large number of vertices (by sparse we mean that the 
number of edges scales as the number of vertices n as n — t- +00). It is convenient to consider a 
sequence of random intersection graphs {G(„)}n, where = Gi(n, m, P) and where m = run 
and P = Pn both depend on n. We remark that is a sequence of sparse random graphs 

whenever the size Xi of the typical random set scales as {m/nY/'^ as m, n — t- 00 ([6]). Assuming, 
in addition, that 

(i) Xi^Jrifrn converges in distribution to some random variable Z\ 

(ii) EZ < 00 and EXi ^njm converges to EZ 
one obtains the asymptotic degree distribution of 

lim P(d(z;i)) = Ee~^i^(ziZ)VA;!, for A; = 0,1,..., (5) 

n— >+oo 

see [6j, [7], [To], [22]. Here d{v) denotes the degree of a vertex v and Zi denotes the z— th moment 
of Z, Zi = EZ*. Along with the first moment condition (ii) we shall also consider the r— th 
moment condition 

(ii-r) EZ^ < 00 and E(Xi y/njmf converges to EZ''. 

We remark that the adjacency relations in a random intersection graph are statistically depen- 
dent events. In particular, the clustering coefficient a = a{G(^n)) = P(^i ~ V2\vi ~ V3,V2 ~ fa) 
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of a sparse random intersection graph G(^n) is bounded away from zero as n— )■ +00 provided that 
the second moment of the degree distribution is finite and the ratio /3„ = m/n is bounded (0, 
[TU]). More precisely, assuming that (i) and (ii-2) hold one obtains as n — )■ +00 



a 



fi-'/X^\62-S,)-' + o{l). (6) 



Here 6r denotes the r— th moment of the asymptotic degree distribution. That is, we write 
6r = Ed^ for a random variable d^: having the distribution P((i* = k) = ^e~^^^ (ziZ)^ /kl, k = 
0,1,.... From ^ we see that /3n +00 implies a = o(l). For comparison, the (unconditional) 
edge probability pe = P{vi ~ ^2) = 5in^^ + o(n~^) is of order 0{n~^) no matter whether /3„ is 
bounded or not ([7J). 

Theorems [1] and [2] show a first order asymptotics as n — )• +00 of c/(r) in the cases where (3^ is 
bounded and {3n +00, respectively. 

Theorem 1. Let m,n — >• 00. Assume that (i), (ii-2) hold. Suppose that f3n ^ P £ (0,+oo). 
Denote A = y^(5i//3. We have 



cl{r) 



p-ie-^(l + o(l)), r = 0; 



a 



ll-o(l), r>2. 



Here pe = n + o{n ^) denotes the edge probability, pe = P{vi ^ V2). 

We remark that for r > 2 the convergence to 1 in ((7j) can be quite slow, especially in the 
cases where the (asymptotic) average degree 61 is large. This can be seen from a more detailed 
expression for cl{r), r > 2, which is obtained from the proof of Theorem [T| 

dir) = {l+p-^f-\A)EMA')y' + Oin-^). (8) 

Here A' = n~^{z2 — A)ZiZ2 and /r(A) = e~^X'^ /r\ denotes the Poisson probability. We note that 
when A is large, the probability fr{A) is small for r << A and, in this case, cl{r) may deviate 
substantially from 1 even for comparatively large values of n, see also Fig. 4 and 5 below. 

Theorem 2. Let m,n ^ 00. Assume that (i), (ii-2) hold. Suppose that fin +00. We have 

P + r = 0; 

la(l + o(l)) + 0(ii-i), r=l. ^' 



In particular, c/(0) = 0{n ^) and c/(l) = o(l), see Furthermore, we have 
d{2) 



f 1 + 0(1), for /3n/n^0; 

i+ff,(g,i3,)2g-4 +0(1), for e (0,+oo); (10) 

,0(1), for (3n/n^+(x>. 



Assuming, in addition, that (5^ = o{n) for each k = 1,2,3 ... , we obtain 

d(r) = 1 + 0(1), for r = 2,3,.... (11) 



5 



1.0 



_ 0.6 



0.2 













1 

: ' 

-" ' 










: ' 
: ' 
: ' 
f 

: 1 










1 

; 1 

: ■ ' 
: : 1 
: 1 










: ■ ' 
: ; 1 
: :l 
:l 










t 

i 

: 

/ ; 

/ i — - 








— n: 

- ■ n= 

.... n: 


=m=1000 
=m = 10000 
=m = 100000 



Figure 3: Convergence to the step function for random intersection graphs with all sets of size 10. 



We conclude from ([T]), ^ that the edge dependence measures cl{l) and a are, in fact, equivalent 
in the case of a random intersection graph. In particular, we have c/(l) = 1 — o(l) 44> a = 1 — o(l) 
and c/(l) = o(l) 44> a = o(l). Furthermore, (10) tels us that the parameter cl{2) is able to 
distinguish between the cases /3„ = o{n) and n = o{f3n)- Finally, (11) tels us that any c(r), 
r = 1,2,... can't distinguish between sequences and {/3^} growing slower that any power 
of n (take /3„ = Inn and = In^ n, for example). 

Remark 1. It is likely that (fToI) can be extended to an arbitrary r as follows 



cl{r) 



fl + o(l), 
c(r, /3*) + o(l), 



for /3„/n^-2'^"' 
for /3„/n4-2'^~' 
for /S^/n^-^*-"' 



0; 

/3* G (0, 

+00. 



-oo 



Here c(r, ;9* 



(A 



r/2 -r-2 



+ 1) 



We note that numbers Zi = "EiZ"^ can be expressed in 



terms of moments of the asymptotic degree distribution ([s]). 
Proofs of Theorems [T] and [2] are given in the Appendix. 

Fig. 3 illustrates the convergence to a step function shown by Theorem[T] Here we plot clustering 



mi 



3c:i— 1 



function ([ij) of random intersection graphs Gi = G{ni,mi, P), where rii 
1,2,3, and P(10) = 1. 

Fig. 4 illustrates the influence of the expected degree 5i on the slope of the clustering function 
^ : the lar ger is 6i the more gradual is the slope. In Fig. 4 we plot ([T]) for random intersection 
graphs Gi = G{n, m, Pi), where n = m = 10^ and -Pi(3*) = 1, i = 1, 2, 3. 

Inhomogeneous graph. The inhomogeneous random intersection graph Gi{n,m, Pi, P2) on 
the vertex set V = {vi, . . . ,Vn} is obtained as follows. We first generate independent random 
variables Ai, . . . , An ,Bi,..., Bm such that each Ai has the probability distribution Pi and each 
Bj has the probability distribution P2. Then, conditionally on the realized values {Ai, Bj}^'J!^^, 

we include the attribute wj £ W in the set Di with probability pij = mm{l, AiBj{nm)~^/^} 
independently for each i and j (see [2], [H], [2T]). We consider a sequence of inhomogeneous 
intersection graphs where Pi, P2 remain fixed and m = nin and n tend to infinity. We remark 
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Figure 4: Clustering function of random intersection graphs with n = m = 10000 and Pi (3') = 1, 
i = 1,2,3. 



that Gn = Gi{n,m, Pi, P2) is sparse and the edge probability pe := P{vi ~ V2) = af62n~^ + 
o(n~^). Here we denote = EA^ and bk = EB^. 

Our next result shows that Gn admits a nonvanishing clustering coefficient a = a{Gn) = P(^'i ~ 
V2\vi ~ V3,V2 ~ V3) in the case where = m/n is bounded and it is bounded away from zero 
as n, m — )• +00. In addition, we show a first order asymptotics of the clustering function c/(-). 

Theorem 3. Let m,n ^ 00. Assume that < EAf < 00 and < EBf < 00. Suppose that 
f^n ^ P & (0, +00). Then we have 



a 



63^ + 0.5^^ 



+ 0(1) 



and 



cl{r) 



a26*n-i(l + o(l)), 



r = 0; 
r = 1; 
r > 2. 



(12) 



(13) 



,1-0(1), 

Here k = aia^^"^ and h*^ = ES'le'''^^^/^ . 

The proof of Theorem|3]is given in the Appendix. We remark that the approach used in the proof 
applies to the case of /3„ — )• +00 as well. In particular, for /3„ — )• +00, we have d(0) = pe(l + o(l)) 
and a = o(l), c/(l) = o(l). One can show, in addition, that 



cm 



'1 + 0(1), 
fe4K'+0.25,9 + °(1)' 

U(i), 



for 
for 
for 



0; 

/3g (0,+oo); 
+00. 



(14) 



Here k' = 03 ^■ 
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Figure 5: Sampling subgraphs with degree constraints. 



The first order asymptotics ([T]), ([To]), (11) and (13) suggests that the clustering function cZ(-) of 
a (very) large affiliation network can be approximated by a step - like function. Furthermore, 
c/(l) is closely related to the clustering coefficient. 

Simulations in Fig. 3 and 4 show that the convergence in ([T]) can be rather slow and we observe 
a sigmoid function approximation of the step function. Furthermore, the larger is the average 
degree, the more remote is the "step" from the origin and the more gradual is the slope of the 
clustering function. 

In order to learn about the influence of the inhomogeneity of the degree sequence on the slope 
of the clustering function r — )• c/(r) we select various subnetworks of real networks according 
to certain regularity conditions satisfied by their degree sequences. We observe that the inho- 
mogeneity (heavy tail) of the degree sequence affects the slope of the clustering function: the 
heavier the tail the more gradual is the slope of the clustering function. We illustrate this 
observations in Fig. 5 and 6. 

Fig. 5 plots clustering function ([T]) of subgraphs of the first university network (see Sect 2.) 
sampled as follows. Qi is the subgraph that includes all vertices of degree not larger than 50. 
It has no = 7165 vertices. Q2 is a subgraph induced by no vertices drawn uniformly at random 
(without replacement) from the vertices of degree not larger than 150. Q3 is a subgraph of 
induced by no vertices drawn uniformly at random (without replacement) from the set of all 
vertices. Now all three graphs have the same number of vertices. 

In Fig. 6 we plot two subgraphs of French actor network (data from |27j). The subgraph ^4 
is induced by the set of marked vertices obtained as follows: we put a mark on each vertex v 
with probability d~'^{v) and independently of the other vertices. Choosing r = 0.5 we obtain a 
random subgraph denoted ^4. In our case the realized number of marked vertices ni = 8871. 
^5 is the subgraph of French actor network induced by ni vertices drawn uniformly at random 
(without replacement) from the set of all vertices. Now both subgraphs have the same number 
of vertices, but the degree sequence of is much more regular than that of Q^. 
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Figure 6: Sampling a subgraph with a random degree constraint. 

Finally, we examine how well a random intersection graph fits the real data. For this purpose we 
consider a memoryless actor network obtained as follows. Assume every actor of a given actor 
graph has forgotten about the titles of movies he or she acted in, but remembers the number of 
movies. 

We first simulate an instance of the active memoryless graph where each actor chooses films 
independently and uniformly at random from a given set of fh films so that the number of films 
chosen by each actor is the same as in the true actor graph. In the active memoryless graph 
all films have equal chances to be selected by any of actors. We remark that in the case where 
rh = m, i.e., the number of films in the active memoryless graph is the same as in the real 
underlying actor network, the expected degree of the memoryless graph does not match the 
average degree of the real network. We can easily adjust the number of films (of the memoryless 
graph) so that these degrees match. We denote this number m' and call the active memoryless 
graph with rh = m' adjusted one. In Fig. 7. we plot clustering function ([T]) of two instances of 
memoryless graphs for comparison with the underlying French actor network: one with the true 
number of films and another with the adjusted number of films. 

We secondly simulate an instance of the inhomogeneous memoryless graph where an actor Vi 
chooses the film wj with probability aibjM~^ independently for each i and j. Here the numbers 
Oj, bj are observed characteristics of the underlying actor network: Vi acted in Oj films; bj actors 
acted in the film Wj. M = Yli<i<n ~ Yli<j<m total number of links of the bipartite 

graph where actors are linked to films. In Fig. 8 we plot clustering function ([T]) of an instance of 
the inhomogeneous memoryless graph of the French actor network. Here we observe a remarkable 
accuracy of the approximation of the real clustering function by that of the memoryless graph. 

5 Appendix 

The section is organized as follows: we first collect some notation, then we prove Theorems [T| [2] 
and [3} At the very end we present two auxiliary results used in proofs: Lemmas [2] and [3| 
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Figure 7: Real French actor network and two simulated active memoryless networks. 



1.0 



0.8 



0.4 



0.2 



0.0, 

























-.."'•■•••*.' 

** * 






?^ * * 
* * 






* 

< 
















*• 

i 

* 

** 
















* 














* 
* 
* 










• • Ir 

* * R 


homogene 
eal 


ous 



20 40 60 80 100 120 140 



r 



Figure 8: Real French actor network and a simulated inhomogeneous memoryless network. 



10 



— 1/2 

By Xni we denote the size of the set Di in Furthermore, we denote Zni = f3n X^i and 

Zqi := Z. Introduce the function 

t ip{t) = supEZ^iI|^^^>t}. 

n>0 

We remark that conditions (i), (ii-2) imply = o(l) as i — )■ +00 (see [7]). By P and E we 
denote the conditional probability and the conditional expectation given Di, D2- We introduce 
events A = {vi ~ ^2}, Ai = {\Di n D2I = i} and probabilities Pi{r) = P{Ai fl {di2 = r}). By 
/r(A) = e~'^X^ /r\ we denote the Poisson probability. 

Proof of Theorems [7] and We have 

.,(.) = P(.4|.. = .) = ^'y,;f!;;» (15) 

In order to evaluate the numerator we write A = L)i>iAi and apply the total probability formula 
P{An{di2 = r}) = ^Pi{r)= pi{r)+Rk{r). (16) 

i>l l<i<k 

Here Rk{r) = X^j>fcPi(?') < P(l^i -^2! > k + 1). Similarly we obtain 

P(di2 =r) = Y,Pi{r) = Pii^) + Mr)- (17) 

j>0 0<j<A: 

In order to prove Theorem [l] we choose k = 1 in (16), (17) and invoke the asymptotic expressions 

of Pi{r) and the upper bound for P(|-Di H I?2| > ^ + 1) shown in Lemma[l} 

Theorem [2] is obtained in the same way, but now we choose k = 2. □ 

— 1/2 

Lemma 1. A ssume that Pn — y j3 G (0, +00]. Suppose that (i), (ii-2) hold. Denote Ai — j3n z\ 

— 1/2 

and A2 = Z2 — f3n z\. We have as n ^ +00 

Po(0) = 1-0(1), p^{r) = o{n~^), r>3, (18) 

p^{r)=n-'-{r\)-^K^24 + o{n-^), r = 1,2, (19) 

pi{r)=n-^zlfr{Ki)+Tr, r>0 (20) 

P2(r) = 2-in-2/r(2Ai)A2 + rs. r > 0. (21) 

Here Tr = 0{n~'^), for r = 0, 1, and Tr = 0{n^^/3n ^^^) + o(n~^), for r > 2. Furthermore, 

P{\DinD2\>3) = o{n''^) and P{\Di D D2\ > k) = 0{n~^), k = l,2. (22) 

Proof of Lemma\I\ Before the proof we introduce some notation. By Pj(-) = P{-\Ai, Di, D2) 
we denote the conditional probability given Ai and Di , D2 ■ By Ig we denote the indicator of an 
event B and write Ig = 1 — I^j. In the proof we use several indicators 

= ^Xj<em}, Ij = I{Xj.<mV2rt-i/4}, l*j = I{Xj<0.5m}- 

Some of them depend on e > 0, value of which will be clear from the context. We will use the 
following simple properties of the Poisson probability A — )■ /r(A). It follows from the mean value 
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theorem fr{t) — /r(s) = /r(0(^ ~ •s)) where < s < ^ < and the inequahties 1/^(01 ^ 1 ^.nd 

<ethat 



\fris)- fr{t)\<\s-t\ and \f2+ris)- f2+r{t)\<{s + t)\s-t\. 

Now we outline the proof. In order to evaluate Pi{r) we write 

Piir) = EP(A n {di2 = r}) = Epi(r)P(A), 



(23) 



(24) 



where Pi{r) = Pi(di2 = r). Next we approximate P{Ai) using (73) and apply the Poisson 
approximation to Pi{r). We observe that, given Ai, Di,D2, the random variable 



d 



12 



3<j<n 



{vi^Vj}^{v2^Vj} 



has binomial distribution Bin(n — 2, qi), where qi is the probability that -D3 intersects with both 
sets Di and D2. Hence, Pi{r) can be approximated by the Poisson probability /r(Ai), where 
Aj = (n — 2)gj. Finally, we approximate Aj by Aj = niji and fr{K) by fr{\)- Here 



go = n ^(2:2 - /?„ ^/^zi)ZiZ2, = n ^^^2:1, q2 = 2n ^/3„ ^/^zi 



(25) 



are approximations of qi , q2 and §3 respectively. In order to obtain an upper bound for the error 
of such approximation we write 



A.,. := Piir) - frCXi) = K,i + Ki^ 
where • = pi{r) - fr{\i) and A" - = fr{Xi) - /r(At), and estimate 

\Ar,i\<2nqf, i, r = 0, 1, 2, . . . , 



(26) 



(27) 



using LeCam's lemma (see Lemmaj2|, and estimate A" • using the mean value theorem, see (23). 
Now we briefly explain approximations (25). Let {wl, . . . ,w*} denote the intersection Di fTT)2 
provided it is non empty. We write q^ = Pi(ni > 1, n2 > 1), where Uj = \D3r1 Dj\, j = 1,2, and 
split 

go = 901 +902, 9i = gii + gi2, 92 = 921 + ^22 + 923 + 924, 



where 



901 = Poim = l,n2 = 1), 902 = Po('^i + ^2 > 3,ni > l,n2 > 1), 

qii = Piiwl G D3), qi2 = Piiwl ^ D^,ni > l,n2 > 1), 

921 = P2{wl G D^,W*2 i D3), q22 = P2{wl i D3,W2 G D3), 

923 = P2{'WI,W2 G D3), 524 = Pl{wl,W2 ^ D3, "1 > l,^^2 > 1) 



(28) 



Proof of (18), (19). In order to prove (|18[), (19) we show that 



Approximations qi w qi, see (25), are obtained as follows. We have qo k. goi ~ 9o, 9i ~ 9ii = 9i 
and q2 ~ 921 + 922 ~ 92- 



(29) 

r = 0,l,2, (30) 

(31) 



EA,,oP(A) = o{n-''''), r > 
E/,(Ao)P(A) = (r!)-^EA5 + o(n 
E/,(Ao)P(A) = o{n-^), r > 3. 



-rA2\ 
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We firstly prove (29). In the case where /3 < oo we find no > such that (3 < 2/3„ for n > no. 
In the case where /3 = +c« we find no such that j3n > 1 for n > no- In order to prove (29) we 
show that for any < e < min{0.5/3^/^, 0.1} and n > ng we have 



E|A,,o| < c^n-' + c^n-''^^Ri{e) + 4n-^e-*/?2(e 



(32) 



where 



Ri{e) = 2ip{£-^) + 4z2(e + m"^ + n'^) 



and 



R2i£) = ipien){l + 4e-^n-2z2). 



Here and below denotes a constant independent of n, m and e. Observe that limt_j.+oo ^{t) = 
implies that the left-hand side of (32 ) is o(n~^^^). Now we fix e and prove (32 ). For this purpose 
we write A^^o = ^r,o^ + ^r.ol and show the inequalities 



B\ArflI\ < c*n-3 + c*n-^^2i?i(e), 
E|A^,oI| < El < 4n-2e-^i?2(e). 



(33) 
(34) 



The first inequality of (34) is obvious. In order to prove the second one we combine the inequalies 

e^n^El < Pn^E{Xi + X2fl < 2p-^E{Xf + X|)I = A^-^BX^I, 

which follow from Markov's inequality, with the inequalities 

p-^-EXfl < /3^^EX2(Ii + I2) < /3-^EX2(Ii + 4e-^n-^p-^Xlh) < i?2(e). 

Here we applied the inequality I < Ii + 12 and then Markov's inequality. 

In order to prove (33) we write A^^ol = ^^^'^01 + ^ro^' (26), and invoke the inequalities 

-3 



and 



(35) 



E\A%\I<c,n-''''Ri{e). 
The first inequality of (35) follows from (27) and the inequalities Qq < 2qQ + 2(go — qo)'^ and 

kol - < n-^m-'^XiX2Riie). (36) 



The second inequality of ( 35 ) follows from ^3 ) and ( 36 ) 
We complete the proof of (29) by showing a36 



We note that for n > no we have 



1/2 



< 1. 



In particular, the inequality Xi + X2 < e^njij implies Xi + X2 < em. We shall show that 



(l-3m-i)go < qoi < (1 + 2e)^o, 
qo2 < 2n-^m-^XiX2{ipie-^) + 2ez2) 



(37) 
(38) 



These inequalities imply (36). In order to prove (37) we write qoi = E^q^^, where ggi = Po(^i 
l,n2 = 11^3), and show that 



xiX2(l - 3m -'^) < Qq]^ < xiX2(l + 2e), 
where xi = m~^XiX3 and X2 = m~^X2(X3 — 1). We have 

Qoi = TiT'i, n = Po(ni = 11X3), T2 = Po(n2 = l|ni = 1,^3) 



(39) 
(40) 



Combining the inequalities, which follow from (73), 



xi(l — (m — Xi) ^) < Ti < xi, 

X2(l -{m-Xi- X2y^) <T2< X2m{m - Xi)-^ 
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with the inequality Xi + X2 < em we obtain ( 39 ) . 

Now we prove (38). We spht go2 = Qoa + Qoi, where gos = Po{ni > 2,n2 > 1) and go4 = PoC't-i ^ 
l,n2 > 2) and construct upper bounds for go3 and go4- Both quantities are estimated in the 
same way. We only consider qo3- We denote pk* = P(rei > k,n2 > Ij-^a), k = 1,2, and write 



qo3 = Ep2* = Ep2*(II*3 + 1*3) < Ep2*II*3 + 'Epulis. 



(41) 



Here we use the i neq uality p2* < Pi*- Next, proceeding as in the proof of the right hand 
side inequality of (39) we obtain pu < xiX2(l + 2e) < 2xiX2. Hence, we have Epulis < 
2n^^m~^ XiX2ip{e^^. In order to estimate Ep2*I*3 we write 

P2*=Tl*T2*, = Po(ni = 21X3), T2* = Po(n2 = l|ni = 2,^3) 

and invoke the inequalities, which follow from ( |73| ), 

Tu < 2-^m-^Xfxl rs* < (m - Xi)-^X2X3. 

We note that Xi < X1+X2 < e^njin'^ < em imply ri*r2=i, < e'^nPn^'^m~^XiX2X^. Furthermore, 
using the inequality X3I=k3 < e"^/?^^ we write 

Ep2*I*3 < ETi*r2*I*3 < e/3„nm"^XiX2EX|. 



Invoking in (41) the upper bounds for Ep2*I*3 and Epi*I*3 we obtain (38). Proof of (36) is 
complete. 



We secondly prove ( |30| ) . For this purpose we write 

/,(Ao)P(A) = (r!)-^ASP(A) + Roi = {rir% + i2oi + ^02- (42) 
Here in the first step we apply the inequality 1 — e~'^° < Aq and in the second step we apply the 

(43) 



inquality, see (73), 



1 - P{Ao) = P{Di r\D2^^)< XiX2m-^. 



using the simple bounds E|i?oi| = 0{n ^ ), i = 1,2. In the case where r 
truncation argument. It follows from the inequalities 



Hence, |i?oi| < >H) and |i?02| < X^oXiX2m-^. Now, for r = 0, 1, we obtain ^ from (|42j) 

2 we invoke the 

(44) 

(45) 

for j = 3, 4. 



that 



I1I2 < 1 < I1I2 + Ii + I2 



/2(Ao)P(A) = Ill2/2(Ao)P(A) +ii03, 



Aq = I1I2A0 + Ro4, 



where |i?oj| ^ ^o(-'-i + -'-2) are negligibly small, i.e., E|i?oj| ^ c*n '^(f{n^/^) = o{n 



Now combining ( |42[ ) and ( 45 ) , and invoking the bounds 
E|i2ojIil2| < c^'E{XiX2m~^flil2 < c^n'^ ^^'E{XiX2 



m 



'^f = 0{n-''% J = 1,2 



we obtain (30) for r = 2. 
Let us prove (31). We write 

E/,(Ao)P(A) < E/,(Ao) < E/,(Ao)(Iil2 + I1 +I2) 
and apply the inequalities /r(t) < fr~j{t) <t^,0<j<r. For r > 3 we obtain 

E^(Ao)Iil2 < EAi]lil2 < c,n-^/^B~Xl = Oin-^^^), 
E/.(Ao)(Ii +I2) < BXlih+h) < c,n-V(n^/') = oin-^). 
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Proof of (20), (21). Given a sequence of random variables {Yn\ and r > we write Yn ~< Tr 



to denote the fact that E|y„| = 0(n-^), for r e {0, 1}, and E|y„| = 0(n-^/3„ ^''^) + o{n 
r > 2. In the proof we use the fohowing inequahties, which are obtained from (|44|) and (73), 



for 



P(A) < ^t, 

P{Ai) < P(A)(Iil2 + Ii + I2) < Xilil2 + Ii + I2 < + I, + I2. 



(46) 
(47) 



Here x, = {Xi)i{X2)i/{il{m)i) and i = 1,2. We remark that (20), (21) follows from the bounds 



/r.(Ai)P(A) - fr{\)>Ci < Tr\/i, 



1,2. 



(48) 
(49) 



Let us show (48). For this purpose we write 



a; ,1,1 + a;',i,i + A. 



and invoke the bounds 

A;^,I*iP(A) ^ Trvi, A'^I*iP(A) ^ Trvi 



A^,iI*lP(A) -< T2. 



(50) 



We note that the third bound of (50) follows by Markov's inequality 

EA,.iI=,iP(A) < EI=,i < (4nm)"V(0.5V^) = o{n'^). 



Next we establish the first and second bound of (50) in the case where i = 1. The first bound 
of is obtained from (27) using the simple inequality qf < 2qf^ + 2g^2 (I'ecall that qn = qi) and 



the inequality, which is shown below. 



(7121*1 < 2n ^Z2XiX2. 



(51) 



More precisely, we note that (51) implies 'Einqi2^*i = 0{n ^) and proceed as follows 
A;iI=,iP(^i) < 2nqll^iP{Ai) < AnqlP{Ai) + 4ng?2l*i ^ -^r- 



In the last step we used the inequality, see (46), 'Einq\P{Ai) < c*n ^/3„^. 



In the proof of the second bound of (50) we combine (23), ( |5l| and apply the simple inequalities 

|Ai - All < 2gi + nqi2, Ai + Ai < 2nqi + nqi2. 
In particular, for r = 0, 1 we have 

EA;'^iI*iP(^i) < E|Ai - Ai|I*iP(^i) < 2EgiP(^i) + Engi2l*iP(^i) -< r^. 
In the last step we invoked the bounds Eq^iP(^i) = 0(n~^) and En(7i2l*iP(.4i) = 0(n~^), 



which follow from (46), (51). For r > 2 we have 



A;'^iI,iP(^i) < |Ai-Ai|(Ai + Ai)I,iP(^i) 

< 4(ng? + v?qiqi2 + ql2)l*iP {Ai 



Here in the last step we invoked the bounds, which follow from (|46j), (51), 

EnqlP{Ai) = 0{n~^p-^), En" qiqi2l*iP {Ai) = 0{n~^ p-^'^) 
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and the inequalities, see (51), (47), 

En\?2l*iPMi) < En2g?2l*i(?^"^^^ + Ii +I2) < c^n'^/^ ^ c^n" 



the inequahties, see (73), 



We complete the proof of (50), in the case where i = 1, by showing (51). We obtain (51) from 



912 < Pi(m > l,n2 > 1) = EPi(ni > l,n2 > Ij^g) = Et[t^, 



Pi(ni > 11X3) < m-^XiXs, 

Pi(n2 > l|ni > 1,^3) < {m-Xi)-^X2X^. 



Now we establish the first two bounds of (50) in the case where i = 2. The first bound is 
obtained from (27) using the simple inequality g| < 2^21 + 2(7^2 + 2^23 + 2^24 a-i^d the upper 



bounds for g2j, J = 1, 2, 3, 4 shown below. In particular, we have ^21 = '722 and 



E- 



m — I 



q22 = EP2(«;^ G D^\wl i L»3,^3)PK i ^3,^3 

where I-R21I < c*n~^(/3„ '^''^ + /3^"^). Furthermore, we have, see (73) 
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+ R: 



■21, 



0^23 = E(X3)2/(m)2 < c=^n' 
9241*1 < c^{nmy^ X1X2. 



(52) 



(53) 
(54) 



Here (54) is shown in the same way as (^51| above. From (^27^, (46) we obtain 



A'^ 2P('^2)I*i < 2nqlP{A2)l*i < ^n{(^^ + + 923)>^2 + ^nq^J^i -< T2. 



In the last step we used the bounds, which follow from (52), (53) and (54), 

^^(921 + 922 + 923)^^2 ^ T2 and ^ T2. 



Let us show the second bound of ( 50 ) . Since the absolute value of 



A2 - A2 = (n - 2)(g23 + 924 + (921 + 922 - 92)) - 292 



is bounded from above by n(g23 + 924 + 2i?2i) + 292, see (53), we obtain from (23) that 

A';2P(^2)I*l < ri(923 + 2i?2l)>^2 + 292X2 + ng24l*l><2''^("~^ + II + I2) ^ T2. 



Here in the first step we applied (46) and the inequality (P(^2)) < X2(^ + Ii + 12)1 which 



follows from (46) and (47). In the second step we used (52), i53v, (54) 



Finally, we prove (49). In the case where i = 1 it suffices to show that 

< E\m-^XiX2 - P{Ai)\ < c,n-2(l + /3-1/2), 



We derive (55) from the inequalities 

m~^XiX2 > P(A) > P(^i)I*i > m~^XiX2{l - Ru). 

Here 

-Ru = 1*1 + (m - Xi)-iXiX2l*i < 2m-^Xi + 2m-^XiX2. 



(55) 



(56) 
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We note that the first and third inequality of (56) follow from (73). Indeed (73) implies 



and we have (1 — (m — X\)^^ X\X2)\it\ = 1 — -Rn- 

Now we prove ( |49[ ) in the case where i = 2. For this purpose we show that for every < e < 0.5 

(57) 



E|x2 - P(^2)| < eEx2 + c^n ^^(e^). 



We note that (57) follows from the inequalities 



^2 > P(^2) > P(^2)I*lI^2 > X2 1 



X\X2 



*1 



m — X\ 

We obtain these inequalities combining the following ones 

1*11*2 < 1 < I*ll*2 +1*1 +1*2, 
X2 > P(^2) > X2 ( 1 ^^^^ 



II^2>x2(l-e-I.i-I^2). 



m — X\ 



Here the first line is obvious, the second line follows from (73). 



Proof of (22). In the proof we apply (73). We have, see (46), 

P(|L»i n D2\ >k)<Xk< Z'lZ^n-''. 



(58) 



Taking the expected values in (58) we obtain (22) for k = 1,2. For = 3 we write, see (44) 
P(|L>i n L»2| > 3) < + + I2 and 

P{\Di n Dal > 3) = p2/3(|Z)i n D2I > 3)P^/^(|Di n L>2| > 3) < xy^{n-'-^/'^ + Ii + 12)^/^ 



Hence, P(|Z)i n L>2| > 3) < ¥.Kl'^{n-^'^ + Ii + 12)^/^ = o(n-2). 
Proof of Theorem^ Given WjjWk, wi £ W we denote the events 



□ 



Tij = {wj G DinD2}, 

Ujk = {wj £Din D3, G L>2 n D3}, 

y-jki = {wj eDiD D2, WkGDiD D3, wiGD2n D^} 



and their unions 



U 



U 

{j,k}c[m] 



jeH {j,k}C[m\ {j,k,l}c[m] 

Furthermore, we denote Ti^ = Ujg[^„i]?^j and V.'^ = ^{j^k}c[m-i]T~(-jk- 



Let us prove (13). To this aim we write cl{r) = p*/{p* + p*), where 

p*{r) := P{vi ~ V2,di2 = r), p* := P(t;i 9^ V2,di2 = r). 



and show that 



p; = Ef^{r)alBln-' + o{n-'), 
p;=fM{r) + 0{n--^). 



(59) 
(60) 
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Here we denote A = aiBmPn ^^"^ and Ai = 2 ^a\a2^n ^ ^ and r,. = 1 + I{r>i}- 

We first show (59). Expanding the event {v\ ~ ^2} = Uj^^^^Tij we obtain, by inclusion-exclusion, 

0<Si-p; <S2. Here 

Si= p{n, n {di2 = r}) = mPinm n {di2 = r}), 

je[m] 

82= P(^ini^fc) = (™)P(i^inH2) = 0(n-2). 

{j,k}c[m] 



(61) 



We complete the proof of ( 59 ) by showing that 

PCHm n {di2 = r}) = EfA{r)alBln-^/3-^ + o{n~^). 
We denote, for short, x := P{'Hm H {di2 = r}) and write 

X = BPCHm n {di2 = r}) = BP{di2 = r\nm)P{n„,) = EP((ii2 = r\'Hn,)pi^p2m, 

where P denotes the conditionall probability given Ai, A2, Bm, and approximate P{di2 = r|?^m) 
by a Poisson probability fx{r). Indeed, di2 = X^"=3lj is the sum of indicators of events Ci = 
{Di n Di ^ ^, DiCi D2 ^ 0} which are conditionally independent, given Ai, A2, Bm,Tim- Hence, 
for A = (n — 2)p and p = P(£3), Lemma I2I implies 



|P(di2 = r\nm) - fx{r)\ < 2(n - 2)p\ (62) 

Next we evaluate the probability p. We observe that £3 = {wm G -^3} U U Ti^. Denoting 
:= P{wm £ D3) we write 

<p-p^ < p{n^) + Pin^). 

Since p^, < aiBm{nm)^^ /'^ and 

P(?ii) = (m - 1)P(?^1) < AiA2aib^n-^l'^m-^'^, 



(63) 



PCH^) 



P('Hi2) < AiA2a2bln 



we obtain from (63) that 



f < 2,P{n^) + ^P{iP) + 2,pl<c (A1A2 + Brn{n^/^ + nl. 



n 



(64) 



Here c is a constant which does not depend on n and m. In the last step we used the simple 
inequality p^, < + 1*, where := I|^^^„i/4| and 1^, := 1 — I^. Now, (62), (64) imply 

X = Efxir)pimP2m + o(n"^). 



Next, using the inequality |/a(?') — fA{r)\ < |A — A| we replace A by A. We have arrived to (61) 
X = E/A(r)pi„p2m + o{n-'^) = E/A(r)af S^n-^/?"^ + o{n-^). 



We secondly show (60). We denote C = {vi V2} and write 

p; = P{du = r\C)P{C). 



(65) 



By the inequalities < 1 — P(C) < EJ23<j<mP^jP'^j^ second probability P(C) = l + 0{n ^] 



For the first probability we apply the Poisson approximation as in the proof of (59) above. In 
particular. Lemma [2] implies 

\P(di2 = r\C) - fxAr)\ < 2(n - 2)f, (66) 
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where Ai = (n — 2)j5 and p = P(£3|C). Next we evaluate p = P(£3 nC)/P(C). We observe that 
£3 n C = C n IH?. Hence, by inclusion-exclusion, we have Q < Si — P(i23 H C) < 52, where 



5i= PC^stnC) = (™)P(?^i2nC), 

{s,t}cH 

S2 = Y. PCHst n n.y) = (^) + 2(m - 2y' ) = 0(n-3). 

Here the sum J2* I'^s over distinct pairs {{s, t}, {a;, y}} of subsets of [m] of size 2 and 

= P(^i2 n Tisi) = 0(n-^), p" = PCH12 n 7^13) = 0(n-6). 
Therefore, P(£3 nC) = Si + 0{n^^). Furthermore, combining (67) with relations 



(67) 



P(^i2 n C) = Epiip3ip22P32q = ala2bl{nm) ^ + 0{n ^), 
here we denote q = {1 — pi2)(l — P21) YYJLsi^ ~ PijP2j) and use 

< 1 - < P12 + P21 + ^ PljP2j, 

3<j<m 

we obtain Si = 2~^af 02^2^^^^ + 0(n~^). We have shown that 

p = P(£3 n C)/P(C) = 2-^a?a2&2^^"^ + 0(n"^). 



Invoking this expression of p in ( 66 ) we derive (pOl) from ( 65 ) , (166 



Let us prove (12). To this aim we write a = P{B)/P{'D), where P denotes the event {vi 



V3,V2 ~ ^3} and B = V D {vi ~ V2}, and show that 

P{B) = XI + o{n-^), P(P) = XI + X2 + o(n-2) 



(69) 



Here xi := afb^n ^/^m ""^/^ and X2 := 2 ^a\a2b'^n ^. To show the first relation of (69) we 
observe that the event Ti} implies B^ and the event B implies IH} U TH? . In particular, we have 
< P(^) - P{V}) < Pin^). Here 



P(?^^) = ('^)P(1^i23) = 0(n- 
Hence, P{B) = PiU^) + 0{n''^). Next we approximate P{n^) using inclusion-exclusion 

s£[m] {s,t}c['m] s£[m] 

and obtain P{T-l^) = xi + o(n^^). Here we invoke the bound 



(70) 



Y P{nlnnl) = {'^)p{nlnnl) = o{ 

{s,t}c[rn] 



n 



and EseM = "^P(^') = ^1 + 

Let us prove the second relation of (69). We observe that V = TH} U IH?' and approximate 



P{V) W Pi-H}) + P(?^2) ^ ^p(^l) + (™)P(-Hi2) = XI + X2 + o( 



n 
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The rigorous argument is a bit more involved since we operate under minimal moment conditions. 
Introduce event A* = {A^ < n^/^} and its indicator function I = I_4*. We construct upper and 
lower bounds for F(T>) using the inequalities 

p{n^ n A*) + P(^2 n A*) - P{n^ nn'^n A*) < p{v n A*) < p{v) < p{n^) + p{n^). 

By the union bound, the right hand side is bounded by mP(^{) + (™)P(^i2) = xi + X2+o(n^^). 
Next we show a matching lower bound for P(I^). Proceeding as in (70) we write 

Pin^ n A*) = mPinl n A*) + 0{n-^), 

where 

P{n\ n A*) = Epiip2iP3iI = Epiip2iP3i + o{n-^) = alb^inm)-^/^ + o(n-^). 
Hence, we have P{'H^ n .4*) = xi + o(n~^). It remains to show that 

Pin^ nA*)> >i2 + o{n'^), P{'H^n'H'^nA*) = o{n-^). (71) 
For the first inequality we write, by inclusion-exclusion, 

P{n^nA*)> Yl P{'HstnA*)-R={'^)P{'Hi2nA*)-R, 

{s,t}c[m] 

where P('Hi2 H A*) = a^a2&2("-"^)^^ + o{n^^) and where 

R = J2 ^i^st n n..y n A*) = (™) C'f)Ri + 2C^) (m - 2)i?2 = o{n-^). 

Here we invoked the bounds 

i?i = p{nu n n^i n < (nm)-^E(Ai^2)^^*(5i 52^3^4)^ = o(n-3- V"^), 

= P(^i2 n 7^13 n A*) < {nm)-^BAiAlAl{BiB2Bsf = 0{n-'^-^^m-^). 

In the last step we used inequalities A^I < A^n^l'^ and < A^n^l'^. 
Finally, in order to show the second bound of ( 71 ) we apply the union bound 

p{n^r\n^^A*)< ^ p(:h^ r\n,tr\A*) < J]] (r, + n+ ^ 

{s,t}c[m] {s,t}c[m] u<i_\m^{s,t) 

and invoke the bounds 



r, = P(?^J n?^,t n^*) < (nm)-^/2E^^^2^2^3^2 ^ o((„^)-5/2) 
= V{n\ n n A*) < {nmy'^/^BAlAlAlB^.B^Bll = 0{{nm)- 



The following inequality is refered to as LeCam's lemma, see e.g., p3]. 



■7/2^1/4 



□ 



Lemma 2. Let S = Ii + I2 + ■ ■ ■ + In be the sum of independent random indicators with 
probabilities P{Ii = 1) = pi. Let A be Poisson random variable with mean pi + ■ ■ ■ + Pn- The 
total variation distance between the distributions Ps of Pa of S and A 

sup |P(5g A)-P(Ag A)| <2Vp?. (72) 

AC{0,1,2...} 
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Lemma 3. ([7]) Given integers 1 < s < di < d2 < m, let Di,D2 be independent random 
subsets of the set W = {1, . . . , m} such that Di (respectively D2) is uniformly distributed in the 
class of subsets of W of size di (respectively d2)- The probabilities p' := P(|-Di H D2I = s) and 
p" := P(|Di n L>2| > s) satisfy 

( {di - s){d2 - s) \ , 
Here we denote p*,^^,^^, = {'^) i'^^) {^)-\ 
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